Markov decision process

Results: 537



#Item
21On Structural Properties of MDPs that Bound Loss due to Shallow Planning 1 Nan Jiang1 and Satinder Singh1 and Ambuj Tewari2 Computer Science and Engineering, University of Michigan 2

On Structural Properties of MDPs that Bound Loss due to Shallow Planning 1 Nan Jiang1 and Satinder Singh1 and Ambuj Tewari2 Computer Science and Engineering, University of Michigan 2

Add to Reading List

Source URL: dept.stat.lsa.umich.edu

Language: English - Date: 2016-04-20 13:16:34
22Journal of Artificial Intelligence Research–1178  Submitted 12/15; publishedExploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks

Journal of Artificial Intelligence Research–1178 Submitted 12/15; publishedExploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks

Add to Reading List

Source URL: jair.org

Language: English - Date: 2016-04-28 15:06:13
23Sequential Decision Making in Repeated Coalition Formation under Uncertainty Georgios Chalkiadakis Craig Boutilier

Sequential Decision Making in Repeated Coalition Formation under Uncertainty Georgios Chalkiadakis Craig Boutilier

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2008-02-08 15:14:59
24Bias in Natural Actor-Critic Algorithms  Philip S. Thomas  Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2012-10-01 18:27:53
25Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a

Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a

Add to Reading List

Source URL: www.hieratic.eu

Language: English
26Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork Bradley Hayes and Brian Scassellati Dept. of Computer Science, Yale University Human-robot teaming has the potential

Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork Bradley Hayes and Brian Scassellati Dept. of Computer Science, Yale University Human-robot teaming has the potential

Add to Reading List

Source URL: bradhayes.info

Language: English - Date: 2016-07-11 15:51:46
27Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT

Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2015-09-14 21:25:04
28Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier

Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2009-03-02 16:24:03
29Sutton, Richard  PIN

Sutton, Richard PIN

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2013-10-18 16:05:54
30Learning from Demonstrations: Is It Worth Estimating a Reward Function? Bilal Piot1,2 , Matthieu Geist1 , Olivier Pietquin1,2 1  Supélec, IMS-MaLIS Research group, France

Learning from Demonstrations: Is It Worth Estimating a Reward Function? Bilal Piot1,2 , Matthieu Geist1 , Olivier Pietquin1,2 1 Supélec, IMS-MaLIS Research group, France

Add to Reading List

Source URL: www.ilhaire.eu

Language: English - Date: 2013-10-03 05:33:46